Design and evaluation of a TOP100 Linux Super Cluster system

نویسندگان

  • Niklas Edmundsson
  • Erik Elmroth
  • Bo Kågström
  • Markus Mårtensson
  • Mats Nylén
  • Åke Sandgren
  • Mattias Wadenstein
چکیده

The HPC2N Super Cluster is a truly self-made high-performance Linux cluster with 240 AMD processors in 120 dual nodes, interconnected with a high-bandwidth, low-latency SCI network. This contribution describes the hardware selected for the system, the work needed to build it, important software issues, and an extensive performance analysis. The performance is evaluated using a number of state-of-the-art benchmarks and software, including STREAM, Pallas MPI, the Atlas DGEMM, High Performance Linpack, and NAS Parallel benchmarks. Using these benchmarks we first determine the raw memory bandwidth and network characteristics; the practical peak performance of a single CPU, a single dualnode, and the complete 240-processor system; and investigate the parallel performance for non-optimized dusty-deck Fortran applications. In summary, this $500K system is extremely cost-effective and shows the performance one would expect of a large-scale supercomputing system with distributed memory architecture. According to the TOP500 list of June 2002, this cluster was the 94th fastest computer in the world. It is now fully operational and stable as the main computing facility at HPC2N. The system’s utilization figures exceed 90%, i.e., all 240 processors are on average utilized over 90% of the time, 24 hours a day, seven days a week.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

System Engineering Implementation Process for Super-Systems

System engineering is one of the most powerful tools for comprehensive project management and control. This tool emphasized the life cycle of the projects, manages every single activity and helps manage the main elements of the project through a set of management and engineering processes. The goal of the current study is to use a system engineering approach in design phase in order or to meet ...

متن کامل

Building a Large Scalable Internet Superserver for Academic Services with Linux Cluster Technology

With the speed and bandwidth offered by the next generation Internet technology, there is a need for large and scalable Internet server that can provides an adequate computing power and storage for the new generation Internet applications. This requires a huge investment in a very large and expensive commercial server system. Recently, the emergence of Linux PC clustering or so-called Beowulf C...

متن کامل

Parallel computing using MPI and OpenMP on self-configured platform, UMZHPC.

Parallel computing is a topic of interest for a broad scientific community since it facilitates many time-consuming algorithms in different application domains.In this paper, we introduce a novel platform for parallel computing by using MPI and OpenMP programming languages based on set of networked PCs. UMZHPC is a free Linux-based parallel computing infrastructure that has been developed to cr...

متن کامل

Conceptual design of a super-critical CO2 brayton cycle based on stack waste heat recovery for shazand power plant in Iran

Conceptual design of a waste heat recovery cycle is carried out in attempt to enhance the thermal efficiency of a steam power plant. In the recovery system, super-critical an CO2 is employed as the working fluid operating in a Brayton cycle. Low grade heat rejected by the flue gases through the stack is used as the primary heat source, while a secondary heat exchanger utilizes th...

متن کامل

Application Performance of a Linux Cluster Using Converse

Clusters of PCs are an attractive platform for parallel applications because of their cost effectiveness. We have implemented an interoperable runtime system called Converse on a cluster of Linux PCs connected by an inexpensive switched Fast Ethernet. This paper presents our implementation and its performance evaluation. We consider the question of the performance impact of using inexpensive co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Concurrency - Practice and Experience

دوره 16  شماره 

صفحات  -

تاریخ انتشار 2004